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Abstract: The democracy-science relationship has traditionally been examined through 
philosophical conjecture and single country case studies. There remains limited global scale 
empirical research on the topic. This study explores country level factors related to the dynamics 
of the global scientific research collaboration network, focusing on structural associations 
between democratic governance and the formation, persistence, and strength of international 
research collaboration ties. This study combines longitudinal data between 2008 and 2017 from 
the Varieties of Democracy Institute, World Bank Indicators, Scopus, and Web of Science 
bibliometric data. Methods of analysis include temporal and weighted exponential random graph 
models (ERGM). The results suggest positive significant effects of both democratic governance 
on international research collaboration and homophily between countries with similar levels of 
democratic governance. The results also show the importance of exogenous economic, 
population, and geo-political factors, as well as endogenous network factors including 
preferential attachment and transitivity. 
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INTRODUCTION 

Knowledge of the relationship between country-level governance and processes and outcomes of 
national scientific performance remains limited. Despite a wealth of country level data regarding 
political, economic, and bibliometric indicators, few studies have examined the association 
between country level factors and research collaboration or the evolution of international 
collaboration networks. As such, the relationship between structural factors and collaborative 
research that spans national borders remains somewhat opaque. Robust research programs that 
measure indicators of democratic governance (Coppedge et al., 2011, 2022) as well as indicators 
of scientific research activity (Fortunato et al., 2018) have developed in parallel but remain 
relatively disconnected. 

Addressing questions of governance are important for a full understanding of the 
antecedents of scientific collaboration. If, as scholars like Karl Popper and Robert Merton 
suggested, democracy is the optimal system of governance for the flourishing of science, then a 
large portion of the world’s human capital remains untapped. The vast majority of the world’s 
population currently lives under closed or electoral autocracies (Boese et al., 2022). Yet, outlier 
countries such as China have made major advancements in international science in recent 
decades, despite remaining one of the most autocratic countries in the international system (Zhou 
and Leydesdorff, 2006; Cao et al., 2020). Further, if democracy is optimal for science, then the 
recent general trend of democratic backsliding and rising authoritarianism makes the question 
more salient (Boese et al., 2022). Global democratization peaked in 2012 and has since reverted 
to 1989 levels, erasing the last 30 years of progress. Further, scientists and scholars are 
increasingly subject to repression by “autocratizing” nations and academic freedom is declining 


globally (Grimm and Saliba, 2017). A philosophical assumption of this study is that international 


collaboration in science is not adequately explained by reference only to facts about individuals 
or institutions, but rather that a nation’s overarching governance structures enable, constrain, and 
influence the self-organization among scientists and scholars (Bhaskar, 2014; Wagner and 
Leydesdorff, 2005). 

The current study builds on research concerned with the relationship between country 
level governance factors and innovation in science and technology (Taylor, 2016; Whetsell et al., 
2021). The study tests hypothesized effects of democratic governance on the probability of 
forming, maintaining, and strengthening research ties in the global network of international 
science. While there are numerous studies spanning disciplines that analyze bibliometric 
indicators of scientific research, from inputs such as inclusion and inequality (Kozlowksi et al., 
2022) to outcomes such as citation impact (Waltman, 2016), the current study leverages 
bibliometric data to construct and analyze a global network of international research 
collaboration ties and its relationship to liberal democracy. This article also builds on research 
that examines cross-governance effects on international cooperation (Kinne, 2013; Gallop, 2016) 
to better understand assortative mixing (homophily) between democracies and autocracies. 
Finally, this article examines more complex processes inherent in a wide array of networks, such 
as preferential attachment and transitivity, which are important processes that manifest in wide 
array of social networks (Barabasi and Albert, 1999, 2002; Newman, 2003a). 

Inferential network analysis techniques are applied to test the hypotheses while 
accounting for the dyadic dependency inherent in network tie formation (Cranmer et al., 2020). 
These tools provide novel techniques for analyzing international collaboration growth suitable to 
the relational structure of the data. While previous studies at the international level have used 


individual level (country level) indicators, this study leverages more recent developments in 


network science to account for endogenous processes inherent in relational co-authorship data. 
To this end, separable temporal exponential random graph models are used to analyze effects on 
the formation and persistence of ties over time, while valued exponential random graph models 
are used to analyze the strength of ties between countries in the international collaboration 
network. Hypotheses are tested using longitudinal data from the Varieties of Democracy 
Institute, World Bank Indicators, and Scopus and Web of Science bibliometric data from 2008 to 


2017. 


LITERATURE REVIEW AND HYPOTHESIS DEVELOPMENT 

The literature review derives hypotheses by synthesizing inter-disciplinary insights from 
disparate literatures across scientometric studies of international research collaboration, 
philosophical scholarship on the nature of science, and political science studies of democracy. 
Given the lack of extant theory on the direct topic, it is necessary to review these literatures in 
order to connect the dots regarding the relationship between national governance structures and 
international collaboration patterns in scientific research. 

International Research Collaboration. The topic of research collaboration as a subject of 
inquiry in science policy is wide ranging and well developed. The term “invisible college” 
captures the early roots of scientific collaboration, referring to a group of seventeenth century 
scientists who organized activities informally through the collaborative exchange of letters, 
which later formed into the Royal Society of London. Derek de Solla Price (1963) documented a 
steady acceleration of coauthored papers in the early 20" century, arguing that science had 
moved from “little science” in the seventeenth century to a contemporary form of “big science” 


involving national expenditures, economic impacts, computers, rockets, and the Manhattan 


Project. One of the most astonishing examples of contemporary international research 
collaboration includes the Atlas Experiment in particle physics, which produced a series of 
publications with thousands of authors from dozens of countries (Castelvecchi, 2015). In the 
contemporary setting, the “new invisible college” refers to a sprawling global network of authors 
connected by papers rather than institutions (Wagner, 2009; Wagner and Leydesdorff, 2005; 
Wagner et al., 2017). The growth and density of the international collaboration network 
continues to increase (Wagner et al., 2015). However, relatively few scholars have questioned 
whether and how the overarching governance structures of nations influence self-organization 
among scientists in the international system. 

Numerous factors are implicated in the dramatic evolution of research collaboration as a 
mode of scientific inquiry over the past century. The costs of cooperation have declined 
substantially, particularly given the rise of the internet, email, and video conferencing (Landry 
and Amara, 1998; Huang et al., 2015). The steadily increasing specialization of scientific 
disciplines (Wray, 2005), as well as the evolution of multidisciplinary research projects (Wagner 
et al., 2011) have also contributed. The complexity, scope, and magnitude of scientific programs 
has increased, requiring participation by numerous researchers to execute single experiments, 
particularly those involving complex and expensive equipment (Katz and Martin, 1995). Further, 
the institutional norms of higher education institutions, as well as governmental organizations, 
have adjusted to accommodate and promote collaborative teams (Defazio et al., 2009). 
Meanwhile, scholars themselves have adjusted to the new norm, realizing gains in “scientific 
human capital” and productivity through collaboration domestically and across national borders 
(Bozeman et al., 2001; Melkers and Kiopa, 2010). The journey toward promotion and tenure 


through the traditional route of “publish or perish” may be less perilous with collaborative 


partners shouldering a part of the burden of knowledge creation. Lastly, collaboration facilitates 
the friendship, career advancement, learning (Dusdal and Powell, 2021), and epistemic goals of 
scientists (Wray, 2002). 

Abramo et al. (2008) showed that the association between research collaboration and the 
scientific productivity of universities varies substantially across disciplines, between basic and 
applied science, and in the context of domestic versus international collaboration. Recently, Fox 
and Nikivineze (2021) showed that collaboration is an important predictor in prolific publication 
productivity in science. International research collaboration, as distinct from domestic 
collaboration, may have even greater returns. As Glanzel and de Lange (2002) showed, 
internationally coauthored papers tend to have higher citation impact than domestically 
published papers. Reversing the question, Abramo et al. (2011) tests whether productivity of the 
scientist predicts higher level of international research collaboration. Their findings indicate 
more productive scientists tend to be internationally collaborative. Thus, there appears to be an 
endogeneity/chicken-egg problem at work, where collaboration begets performance begets 
collaboration. However, the literature has repeatedly shown that international research 
collaboration results in higher productivity and performance (Van Raan, 1997). Wagner et al. 
(2017) questioned whether international collaboration tended to produce conventional, novel, or 
atypical research, finding, contrary to expectation, that conventional research tended to be the 
most typical outcome. Using a different measure of internationalization in research, Sugimoto et 
al. (2017) and find that mobile scientists with many country affiliations in their publication 
record tend to build up international collaboration networks as they travel within the 


international system and also tend to have the highest citation impact. Similarly, Robinson- 


Garcia et al. (2019) found that mobile scientists tend to have higher publication output and 
citation impact. 

Fewer studies have examined the effects of national level characteristics. Glanzel and 
Schubert (2005) suggest that larger countries, such as India, Japan, United States, and Turkey 
tend to produce a higher proportion of domestic publications than smaller countries. Conversely, 
smaller or more remote countries, such as Switzerland, Austria, South Korea, tend to have higher 
international rates of publication. Leydesdorff et al. (2019) recently examined whether the 
influence of government funding or international research collaboration effects a nation’s 
citation impact finding the balance favors international research collaboration. 

Finally, Subramanyam (1983) suggested coauthorship as an indicator of collaboration has 
its advantages as a measure, being non-reactive, invariant, quantifiable, and inexpensive. 
However, Katz and Martin (1997) cautioned against reading too much into the term 
‘collaboration’. As Davidson and Carpenter (1979) suggest there are also ambiguities that 
should be appreciated in international collaboration, such as single authorship with multiple 
affiliations, but also suggested such errors are small relative to the size of bibliometric databases. 
Thus, international coauthorship may not always reflect true ‘collaboration’. Rather, the 
approach of this study is that the term ‘international research collaboration’ refers minimally to 
instances in which two or more institutional affiliations referring to distinct countries co-occur 
on a single research paper. While this operationalization is not perfect, global scale analysis 
would not be possible assuming a more discriminating investigation into the ‘real’ versus 
superficial collaborations possible by closer evaluation at the paper level. 

Governance Effects on Innovation in Science and Technology. The rise of China in the 


international system of science increases the salience of questions regarding democratic 


governance and scientific capacity. These are traditionally assumed to be complementary if not 
critically compatible. But few large-scale empirical studies have examined the general effect of 
governance on scientific processes and outcomes, or on how the relationship might influence 
international research collaboration. Scholars suggest that social structure influences 
organizations and individuals, who themselves reproduce and sometimes transform the structures 
within which they are embedded (Whitley, 2000; Bhaskar, 2014). However, while a large body 
of research has focused on the individual, the discipline, and the institution, the effects of 
political governance structures on the self-organization of scientific praxis is not empirically well 
established. 

The conjecture that democracy is the optimal political environment for the advancement 
of science has existed since both popular sovereignty and natural philosophy have co-occurred 
under the same jurisdiction of governance. In the classic two volume work, The Open Society 
and its Enemies, the philosopher of science Karl Popper (2020) defends liberal democracy 
against both fascism and communism by disarticulating the ideological commitments to 
historicism found in the latter two systems. The notion that history follows pre-determined 
trajectory toward some final, perfected, and inevitable political end-state is antithetical to both 
the notion of political liberalism and to the necessity for a free and open society. Parsons (1951) 
suggested that a society’s social structures shape its scientific development, while Barber (1953) 
suggested that the values of democracy and science are compatible with each other and 
incompatible with autocratic rule. Merton (1938, 1973) suggested that a scientific enterprise 
concerned with the pursuit of impersonal truths should be unfettered by ideological dogma, is 


consonant with democratic governance, and clashes with totalitarianism. Work in this conceptual 


vein suggests a fundamental compatibility between the normative elements of democracy and 
science. 

As Taylor (2012) suggested, “[i]nnovation is inherently political in that it involves highly 
contested decisions over the allocation of resources, institution and policy design, and the 
formation and maintenance of domestic and international political economic networks” (p.150). 
Wang et al. (2021) conducted a test of Popper’s hypothesis regarding positive effects of 
democracy on innovation, using patents and trademarks. They show significant positive effect of 
democracy on innovation across a variety of empirical tests with the greatest returns occur for 
countries with lower levels of economic development. Similarly, Schmid et al. (2017) show that 
democratic institutions are associated with patent production. These results are contradicted by 
Gao et al. (2017), who test the hypothesis on a subset of patents. Thus, the results of this 
literature are somewhat mixed. While these observations refer primarily to technology, it is 
reasonable to suggest that political governance also has significant effects on scientific 
innovation. In one empirical article specifically on the topic, Whetsell et al. (2021) recently 
provided evidence in support of the democracy-science hypothesis, showing positive significant 
effects of democratic governance on scientific performance. They suggested that democratic 
governance might enhance the internal and external complexity of the country, such that science 
is provided wider latitude for self-organization on intellectual fitness landscapes than 
autocratically governed countries (Wiesner et al. 2018; Eliassi-Rad 2020). International 
collaboration as a type of external complexity may produce a greater variety of conventional, 
novel, and atypical ideas (cf. Uzzi et al., 2013; Wagner et al., 2019). Whetsell et al. (2021) also 
showed a negative interaction effect between democracy and international collaboration 


intensity, where the effect of democracy on scientific performance diminished for higher levels 


of international collaboration intensity. This counter-intuitive findings incites further research 
into the effects of democracy on international research collaboration. 

Democracy and Academic Freedom. To more clearly articulate a potential causal 
pathway from political governance to collaborative self-organization, a newly developing 
empirical literature on democracy and academic freedom is briefly reviewed. Coppedge et al. 
(2011; 2021, pp. 44-45) suggest liberalism is based on a negative concept of liberty that is 
realized when the political power of the executive is constrained by effective checks and 
balances, the rule of law, constitutionally protected civil liberties, and an independent judiciary. 
At the same, liberalism must be combined with a structural view of electoral democracy, under 
which suffrage must be extensive, there must be freedom of association and expression, and 
there are clean elections for public officials. This structural element is based on Dahl’s (1971, 
1989) concept of polvarchy. However, liberal democracy is not the system of government for the 
majority of the world. Roughly 70% of the global population live under closed or electoral 
autocracies (Boese et al., 2022; Coppedge et al., 2022). As Liihrmann and Lindberg (2019) 
suggested, the world is experiencing a “third wave” of autocratization. Further, as several 
observers have noted, the latest wave of autocratization is less blunt and more sophisticated. In 
other words, autocratization is now less likely to occur as a result of an outright military coup 
and more likely to involve legalistic machinations, misinformation/disinformation campaigns, 
toxic political polarization, and sham elections (Boese 2022). As Hellmeier et al. (2021) point 
out, autocratizing countries appear to follow a predictable pattern which typically begins by 
repressing academic freedom, media, and civil society. For instance, Enyedi (2018) details the 


methods by which academic freedoms have been eroded in recent years in Hungary coinciding 


10 


with its descent into electoral autocracy. According to theory, autocratization of nations should 
hinder the self-organization of scientists within and across national borders. 

In summary, theory suggests that liberal democracy provides the optimal conditions for 
the self-organization of science as a whole through the development of academic freedom, where 
members of the academic community retain individual and collective rights to develop and 
communicate knowledge, are autonomous from the state and other political forces, and are free 
to cooperate with others across boundaries and borders (Fernando 1989). Berggren and 
Bjørnskov (2022) show that democratic political institutions are a significant predictor of 
academic freedom. Thus, there is a clear connection between democratic governance and the free 
conduct of science. In liberal democracies, where scientists are permitted and encouraged to 
freely associate with others across ideological, political, and national borders, we should expect 
greater levels of international research collaboration. In more autocratic systems, where 
academics are censored, monitored, and their movements are restricted, we should expect lower 
levels of international research collaboration. As such, the following hypothesis is warranted. 

Hypothesis 1: Democratic governance is positively associated with international 
research collaboration a) tie formation, b) tie persistence, c) and tie strength in the 


global scientific collaboration network. 


Assortative Mixing Effects of Governance Structure. Assortative mixing refers to a 
process within networks in which actors tend to form connections with actors of a similar 
(homophily) or dissimilar (heterophily) characteristic (Newman 2003b). When actors in a 
network form social ties on the basis of similarities, such as race and ethnicity, religion, group 


membership, or any other construct real or imagined, this is often referred to as homophily in the 
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social network literature. A common metaphor used to describe this process is “birds of a feather 
flock together” (McPherson et al., 2001). In contrast to homophily, when actors in a social 
network form connections based on complementarity or dissimilarity on a given attribute this is 
referred to as heterophily, and includes examples such heterosexual mate selection, mentees 
seeking out older “wiser” mentors, or constructing skill complementarity in team structures. In 
the context of the present study, the question may be formulated as follows. Do countries with 
similar governance structure tend to collaborate more with each other? 

The question of democratic collaboration is perhaps best situated within a stream of 
international relations theory loosely referred to as liberal institutionalism. In contrast to realist 
and neorealist theories of international relations which deprioritize the internal characterization 
of a nation’s governance structures and instead prioritize its military capabilities in balance with 
those of respective competitors (e.g. Mearsheimer, 2001; Waltz, 2018), liberal institutionalism 
stresses the importance of reducing the transaction costs of cooperation through the construction 
of international regimes (e.g. Keohane, 2005). As the logic goes, conflict can be reduced by 
erecting international institutions, democracies tend to engage in institution building, therefore 
democracies tend to cooperate more with each other and have less conflict. This vein of theory is 
more broadly known as the “democratic peace”, the “liberal peace”, or the “Kantian peace”. 
Within the framework of liberal institutionalism, Wagner’s (2009) “new invisible college”, as a 
global network of science, might be considered a type of international institution that promotes 
international cooperation and generally reduces the prospect of conflict between nations. These 
broader effects however are beyond the scope of this article. As Wagner et al. (2015) suggested, 


the global network of science should be conceptualized as a new type of institution. It is 
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reasonable then to question whether democracies may be more engaged in homophilous 
collaboration in this new institution. 

The broad question posed in this article, how do the domestic characteristics of a country 
affect its tendencies to engage in international research collaboration, is quite similar to a 
question that political scientists have been interested in for decades. Lai and Reiter (2000) 
argued, “the central question of international relations in the past decade has been, What are the 
connections between domestic and international politics”(p. 204). Leeds (1999) argued that 
democratic leaders, by virtue of their accountability, are more likely to make and uphold credible 
foreign policy commitments. According to this theory, democracies might be viewed as more 
reliable partners for international cooperation and may also less likely to form weaker 
agreements with other non-democratic countries. As Gallop (2017) suggests, “it seems both 
plausible and empirically justified that states should gain somewhat more utility when they 
cooperate with like regimes” (p.315). The logic of international cooperation based on regime 
similarity appears to be isomorphic with the research question of international research 
collaboration based on governance structure. However, an important difference here is that the 
interactions between countries discusses above tend to be formal agreements between 
governments, while international collaboration is the outcome of self-organization of scientific 
activity. 

As Mansfield et al. (2002a, 2002b) show, countries with democratic governance are more 
likely to cooperate on trade policy. In contrast, Lai and Reiter (2000) provided evidence against 
the hypothesis that democracies tended to engage in more formal alliances with each other. 
However, they did show that democracies were more likely to form defense pacts after 1945, and 


that, more broadly, countries with similar regime type are more likely to cooperate with one 
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another in the international system. Using inferential network analysis techniques on a data set of 
international agreements between WWII and 1980, Kinne (2013) shows that international 
cooperation is significantly driven by endogenous network effects, such as preferential 
attachment and transitivity. He also shows significant positive effects of democracy, which 
appear to overwhelm the effects of military capabilities on military agreement formation. 
However, interestingly he shows a negative direct effect on formal science agreements during 
this time period and a non-significant and positive homophily effect. In a subsequent study, 
Kinne (2014) shows positive significant direct effects and homophily effects of democracy on 
diplomatic tie formation. In a similar vein, Warren (2016) also uses inferential network analysis 
to show that democracies tend to form alliance ties with one another. 

Hypothesis 2: Countries with democratic governance are more likely to a) form, b) 

maintain, and c) strengthen international research collaboration ties with each other 


than with autocratic countries, and vis versa. 


METHODS 

Modeling Approach: This article employs inferential social network analysis techniques in the 
family of models referred to as exponential random graph models (ERGM) (Wasserman and 
Pattison, 1996; Robbins et al., 2007; Lusher et al., 2013). Exponential random graph models are 
appropriate to account for dyadic dependencies found in relational international collaboration 
data. While, there are numerous studies on international collaboration, few have employed such 
methods. In a nutshell, the ERGM provides estimates on parameters of interest by simulating a 
probability distribution of thousands of networks then comparing elements of the observed 


network to the simulated distribution. In addition to including exogenous parameters of interest, 
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ERG models also include endogenous terms that account for complex dependencies in the data, 
such as density, preferential attachment, and transitivity, which are known to affect tie formation 
in networks. Two specific types of ERG model are used in this article disused below, valued 
exponential random graph models (VERGM) and separable temporal exponential random graph 
models (STERGM). 

First, STERGMs are used to account for the longitudinal nature of the network data with 
discrete yearly networks (Hanneke et al., 2010; Krivitsky and Handcock, 2014). Scholars have 
previously used these models on longitudinal bibliometric and scientometric data (Fagan et al., 
2018). The STERGM approach may be preferable to the stochastic actor oriented (SAOM) 
model in this specific case, because coauthorship ties are non-durational instances measured 
temporally at the year in which the paper was published. Thus, there is no ‘actor-oriented’ 
decision to dissolve the tie in the following year, which the SAOM model assumes (Leifeld and 
Cranmer, 2019). Rather, in this case, tie persistence between countries is a function of new 
papers continually being published every year. STERGMs produce probability estimates on 
parameters of interest for the both the formation and dissolution of ties in the network 
simultaneously, where dissolution of ties is interpreted as ‘persistence’. STERGMs assume that 
the formation of a tie and the dissolution of a tie are ‘separable’, independent of each other in a 
given instance. In other words, a coauthorship tie can form between authors in two countries 
while one or both can still form ties with other countries without dissolving the original tie. The 
conditional maximum likelihood estimation (CMLE) option was used, as the data represent a 
network panel rather than a single network with tie duration information. For an in-depth 


discussion of the theory behind the method see Krivitsky (2014). For more information on the 
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implementation of the method in the R package ‘tergm’, see Carnegie et al. (2015) and Krivitsky 
and Handcock (2021). 

Second, there are limited options available to analyze edge weights in valued networks. 
Because the ties in the international research collaboration network are collapsed on countries as 
nodes, ties are stacked according to the number of instances where countries have authors 
collaborating with each other in a given year. These frequencies are referred to as tie weights or 
valued ties in a weighted/valued network. Valued networks remain less well understood and 
developed, and also pose greater computational challenges than binary ERGMs (Huang and 
Butts, 2021). Often, such networks are simply binarized and analyzed using standard ERG 
models (Cranmer and Desmarias, 2011). More recently, valued ERGMs (VERGMs) have been 
advanced and implemented in the R package ‘ergm.count’ to analyze weighted networks 
(Krivitsky, 2012; Krivitsky and Butts, 2013), which is part of the broader R package 
“statnet’ (Handcock et al. 2008). Rather, than focusing on the probability of a tie forming between 
two nodes, VERGMs produce estimates based on the strength of ties. Thus, VERGMs are 
suitable to addressing the hypothesis regarding the strength of research collaboration ties in the 
international system. As Huang and Butts (2021) suggested, there is an ‘acute’ need for analysis 
of tie strength of social connections between nations. Further, in order to test for effects over 
time, the VERG models are replicated for each network yearly cross section across the full time 
period. More details on the implementation of the models are described below in the results 
section. Model diagnostics are presented in the Appendix. 

Data and Variables: This research combines data from numerous sources. First, 
international research collaboration metadata were gathered from Indiana University’s CADRE 


portal containing the Web of Science database on all articles and reviews for the years 2008 to 
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2017 (Mabry et al., 2020). The metadata includes author-country affiliations for individual 
papers. These data are aggregated at the country level, where the paper represents a network 
tie(s) between two or more affiliated countries. Aggregating at the country level produces a 
valued meta-network where all ties between the countries are aggregated as frequency weights 
on the network ties. Aggregating at the country-level avoids well-known problems associated 
with author disambiguation (c.f. Gurney et al., 2012). Since there are already a large proportion 
of binary research collaboration ties between many countries in the international system, much of 
the variability in the data resides in the tie weights in addition to binary tie occurrence. The data 
were used to produce symmetrical, undirected, valued matrices for each year between 2008 and 
2017. The exact number and order of nodes across years was maintained for comparability 
between years using the final year as the node set, resulting in a sample of 200 countries. 
However, after eliminating country column/rows for those with missing data in the primary 
independent variable, the final node sample included 169 countries. 

The primary independent variable of interest is the Varieties of Democracy Institute’s (V- 
Dem) liberal democracy index (Coppedge et al., 2011; Coppedge et al., 2021). The V-Dem data 
represents the most comprehensive and current characterization of democracy across nations in 
the international system and is more granular than other extant measures such as the Economist 
Democracy Index or the Polity project. The liberal democracy index combines an index of 
liberalism with an electoral democracy index. The liberal component index is composed of three 
sub-components: 1) equality before the law and individual liberties, 2) judicial constraints on the 
executive, and 3) legislative constraints on the executive. The electoral democracy index is based 
on Robert Dahl’s (1971, 1989) characterization of polyarchy, and is composed of five democracy 


components measuring 1) freedom of association, 2) free and fair elections, 3) freedom of 
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expression and alternative sources of information, 4) elected official, 5) and suffrage (Coppedge 
et al., 2021). The index is standardized on a 0-1 scale, where higher on the scale represents a 
greater realization of the “ideal” of liberal democracy. 

This research employed a select number of control variables to account for systematic 
differences between countries that might also drive collaborative tie formation. These include 
three measures from World Bank Indicators, including gross domestic product per capita, the log 
of population size, and urbanization. The inclusion of these variables as controls is relatively 
straightforward. The wealth and population size of the nation likely enhances its ability to 
participate in a variety of collaborative scientific activities, while urbanization may reduce the 
need to collaborate externally due to a concentration of domestic collaboration partners. The 
number of authors aggregated at the country level was used to control for the higher probability 
of international collaboration due simply to the number of scholars available for a collaboration. 
This variable may provide a more granular control than other commonly used measures such as 
number of Ph.D. granting institutions or secondary school enrollment. These data were captured 
separately from Elsevier’s Scopus data. Other control variables were considered such as 
R&D/GDP and GERD. However, the data coverage for these variables is far smaller and 
therefore were not included due to the broad nature of the hypotheses. Finally, region was 
included as a fixed effect to account for proximity and political effects of geography by using the 
V-Dem data’s geo-political region variable, which includes ten regions. 

Missing Data: Some variables had missing data. ERG models require no missing data on 
node covariates. Therefore, a basic imputation strategy was taken. Median imputation was used 
for highly skewed distributions and mean was used for approximately normally distributed 


variables. For four countries the GDP was imputed if values were available from adjacent years, 
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using the mean of the available values. For two remaining countries, GDP was median imputed 
based on data from other countries. For population size, one country had half of its time values 
missing, therefore the remaining values were imputed based on the last available year. One other 
country was median imputed for the full time period. The same was done for one country on 
urbanization, and one without was median imputed. 

Endogenous Controls: Network models in the exponential family offer a wide variety of 
endogenous terms which model the various sources of dependency in relational data. Over the 
past decade or so of research using ERGMs, a more or less standard set of terms tend to be used 
to account for density, preferential attachment, reciprocity, and transitivity. For the STERGM 
model, the ‘edges’ term accounts for tie formation and persistence driven by network density, 
geometrically-weighted degree distribution (GWdegree) accounts for preferential attachment 
where a negative sign indicates its presence. Geometrically-weighted edge-wise shared partner 
distribution (GWESP) and geometrically-weighted dyad-wise shared partner distribution 
(GWDSP) account for various types of transitivity (friend-of-a-friend dynamic) in social 
relations. For the VERGMs, a different logic is applied to model the sample space. While the 
terms are not perfectly synonymous, the ‘sum’ term is used to account for the endogenous effect 
of tie strength, ‘nodesqrtcovar’ is use to account for actor heterogeneity, and ‘transitive.weights’ 
is used to account for transitivity with the ‘min’,’max’,’min’ specification. Following, Pilney and 
Atouba (2018) the latter two are interpreted similarly to the binary ERGM terms GWdegree and 


GWESP. 
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RESULTS 

The results are presented as follows. First, visualizations of the networks are presented to 
illustrate the evolution of the network over the time period. Second, descriptive statistics on the 
structure of the whole-network are presented for each year. Third, temporal exponential random 
graph models are presented that show tests of hypotheses regarding the formation and 
persistence of international research collaboration ties. Finally, weighted exponential random 


graph models are presented showing the hypotheses tests regarding the strength of these ties. 


Figure 1. International Collaboration Networks 2008 to 2017 — a table is also needed 


z 
A 


\ 


Figure Notes: Top row is networks 2008 through 2012, bottom is 2013 through 2017. The networks are colored 
according to their democracy score, where dark red is lowest and dark green is highest. Nodes are sized by node 
degree centrality. 


Figure 1 shows the international collaboration networks by year on the top row from 
2008-2012 from left to right and 2013-2017 on the bottom row. The nodes are colored by the 
liberal democracy index, where darkest green is highest democracy and darkest red is lowest 
democracy. Isolates are removed from the figure for visualization purposes. The Kamada-Kawai 
algorithm (Kamada and Kawai, 1989) was used as the network layout, nodes are sized by degree, 


and the R package ‘GGally’ was used to visualize the network (Tyner et al., 2017). The networks 
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clearly show that countries scoring lower on the liberal democracy index tend to occupy 
peripheral positions in the network, while those scoring higher tend to occupy the core of the 
network. 

Table 1 shows the whole network descriptive statistics for each year of the edge-weighted 
network from 2008 to 2017. The total number of nodes is the same each cross section. The total 
number of edges, total edge weight, and maximum edge weight increase continuously over time. 
The number of disconnected components decreases over time. One very striking finding is the 
substantial increase in the density of the network over the time period, as gradually it appears all 


countries may be connected to each other at some point in the near future. 


Table 1 — Whole Network Descriptive Statistics, 2008 to 2017 
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 


nodes 169 169 169 169 169 169 169 169 169 169 
isolates 13 9 8 6 5 3 3 3 3 0 
total edges 5382 5758 6067 6307 7064 7207 8024 8708 9517 9951 


total weight 520832 520832 625175 755072 901466 954645 1056047 1178363 1378776 1406984 
max weight 13593 14543 15577 18771 22059 26676 30882 35906 40900 43706 


components 14 10 9 7 6 5 4 4 4 1 
density 0.38 0.41 0.43 0.44 0.50 0.51 0.57 0.61 0.67 0.70 
centralization 0.54 0.54 0.52 0.53 0.47 0.47 0.425 0.36 0.32 0.30 


closed triads 105808 119228 131311 136946 178692 182199 233027 275914 333135 363250 
Table Notes: The table shows descriptive statistics of network cross-sections. Nodes is the number of countries. 
Isolates is the number that have no connections. Total edges is the sum of ties between countries. Total weight is the 
sum of edge weights between countries. Components is the number of clusters disconnected from the main cluster (1 
means all clusters are connected). Density is the proportion of observed to potential ties. Centralization is the degree 
to which ties are centered on few nodes. Closed triads is the number of closed triads in the network. 


The network becomes extremely dense by 2017, which presents computational 
challenges for inferential network models. At the same time the centralization of the network is 
declining over time as more countries are connecting to each other and not just to a few highly 


connected countries. Finally, among the 790,244 possible closed triads in the network, the triad 
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census shows an incredible number of closed triads in the network, which more than triple over 
the time period. This is probably due, at least in part, to the nature of multi-country publications, 
which by definition are represented as a set of closed triads between all countries listed. 

Before discussing the results of the STERGMs, there are a few important notes on the 
modeling approach. Because STERGMs are unable to account for edge weights in valued 
networks, four separate models are presented that binarize (0/1) the edge weights at certain cut 
points. In the first model, edge weights of zero are left zero, and edge weights at one or higher 
are set to one. This is termed the ‘full network’, as this does not affect the number of ties or the 
density in the network. This is a common strategy for addressing edge weights and density issues 
in network analysis (Cranmer and Desmarias, 2011). Next, in models two, three, and four, the 
networks are binarized at the quantiles of the natural logarithm of the edge weights (+1) 
excluding zero from the distribution. The natural logarithm was used because the edge weights 
have a highly skewed distribution. In model two, the networks are binarized above and 
below/equal to the 1* quantile. In model three, the networks are binarized above and below/equal 
to the median of the distribution. In model four, the networks are binarized above and 
below/equal to the 3“ quantile of the distribution. Binarizing in this way effectively reduces the 
density of the networks because the ties require a progressively higher edge weight to be counted 
as ties in the network. 

Recall, hypothesis one concerns the effect of democracy on the (a) formation and (b) on 
the persistence of ties, while hypothesis two concerns the effect of governance homophily on tie 
(a) formation and (b) on persistence. Table 2 shows the STERGM results of four models: 1) full 


binarized network, 2) binarized at the first quantile, 3) binarized at the median, and 4) binarized 
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at the third quantile of edge weights. Each model has two columns of results for the formation 


and persistence sides of each model.! 


Table 2: Separable Temporal Exponential Random Graph Models, 2008-2017 


1. Full Network 2. First Quantile 3. Median 4. Third Quantile 
Formation Persistence Formation Persistence Formation Persistence Formation Persistence 
edges -62.92*** -14.12*** -22.08 *** -15,68 *** -22.05°* -13.95°"* -14.77 *** 3.61 *** 
(0.27) (0.03) (0.18) (0.02) (0.77) (1.05) (1.12) (0.05) 
GWeegree [1.5/3] -9.69*** -3,.41*** -7.54 *** -1.69 *** -4.73"" -1.33"" -7.15 *** -4,53 *** 
(0.03) (0.14) (0.36) (0.15) (0.37) (0.16) (0.31) (0.25) 
GWESP [0.5/1.5] 30.97*** 1,19*** 4,73 *** 0.46 *** 139°" 0.29" 1.11 *** 0.02 
(0.24) (0.09) (0.20) (0.09) (0.24) (0.08) (0.17) (0.08) 
GWDSP [0.5/0.5] 0.19* 0.07*** 0.07 *** 0.04 *** 0.04% 0.03** 0.07 *** 0.01 * 
(0.09) (0.01) (0.00) (0.01) (0.00) (0.00) (0.00) (0.01) 
nodecov.libdem 0.45*** 0.25*** 0.38 *** 0.16 * 0.37" -0.03 0.01 -0.44 
(0.05) (0.06) (0.05) (0.08) (0.07) (0.11) (0.11) (0.23) 
absdiff.libdem -0.48*** -0.32*** -0.26 *** -0.57 *** -0.37"" -0.27° 0.02 -0.69 ** 
(0.06) (0.08) (0.07) (0.09) (0.09) (0.13) (0.13) (0.27) 
nodecov.|nauthors 0.52*** 0.56*** 0.62 *** 0.63 *** 0.76" 0.55" 0.64 *** 0.49 *** 
(0.01) (0.01) (0.01) (0.02) (0.02) (0.03) (0.03) (0.06) 
absdiff.Inauthors -0.13*** -0.06*** -0.12 *** -0.12 *** -0.11"" -0.12"™" 0.15 *** 0.10 * 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.02) (0.03) (0.04) 
nodecov.urban -0.00*** -0.00 -0.00 *** -0.00 *** -0.00° -0.00 -0.01 ** 0.01 * 
(0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) 
absdiff.urban -0.00*** -0.00 0.00 0.00 * 0.00 0.00 -0.01 *** -0.00 
(0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) 
nodecov.Ingdp 0.06*** 0.04* 0.09 *** 0.17 *** 0.13°" 0.17" 0.09 * -0.29 *** 
(0.01) (0.02) (0.02) (0.02) (0.02) (0.03) (0.04) (0.06) 
absdiff.Ingdp -0.08*** -0.14*** -0.11 *** -0.13 *** -0.10°" -0.15% 0.01 -0.16 * 
(0.01) (0.02) (0.02) (0.02) (0.02) (0.03) (0.04) (0.07) 
nodecov.lnpop 0.07*** 0.02 0.05 *** 0.07 *** 0.07" 0.09" -0.04 -0.10 * 
(0.01) (0.01) (0.01) (0.01) (0.02) (0.03) (0.03) (0.04) 
absdiff.Inpop -0.02 0.00 -0.02 * -0.03 * -0.07" -0.08% -0.13 *** -0.14 ** 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.02) (0.02) (0.04) 
nodematch.region 1.57*** 2.70*** 1.99 *** OY ais 2.53°" 2.46" L5L ee" 1.79 *** 
(0.04) (0.06) (0.04) (0.07) (0.06) (0.09) (0.09) (0.19) 
nodefactor.region Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed 
Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed 


x 


**p < 0.001; “p < 0.01; *p < 0.05; standard errors in parentheses; see appendix for model diagnostics; alpha and decay values in brackets. 


Nodecov is a continuous node covariate. Absdiff is the absolute difference between continuous node values (negative coefficient = homophily). 
Nodematch is the match between shared node categories. Nodefactor is a categorical node effect (positive coefficient = homophily). Edges 
represents density. GWdegree represents preferential attachment. GWESP and GWDSP represent transitivity. 


1 Asa side note, while the model terminology of the STERGM is ‘dissolution’, this is actually interpreted as 
‘persistence’ in the ‘tergm’ manual notes. Hence, the label persistence is used to avoid confusion. 
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The model term ‘nodecov.libdem’ provides support for hypothesis 1(a) across models 1- 
3, showing that the liberal democracy index has a positive significant coefficient on the 
probability of tie formation in the networks. This term is non-significant in Model 4. The results 
also show support for hypothesis 1(b), regarding the persistence side of model 1 and model 2 but 
not model 3 and model 4. The results show support for hypothesis 2(a), regarding governance 
homophily effects on tie formation. The model term ‘absdiff.libdem’ shows significant 
coefficients across models 1-3, suggesting ties tend to form between countries with similar 
liberal democracy scores (the negative sign indicates that the further away the democracy scores 
are between two countries the less probable is a tie). The term on model 4 is insignificant. Lastly, 
the tie persistence side of the models shows significant estimates across all four models, 
supporting hypothesis 2(b). 

Changes across the four models suggest that as the network is trimmed with an elevated 
criterion for the counting of ties (higher edge weight necessary for counting the tie), democracy 
becomes less important to tie formation and persistence. In other words, a portion of the 
magnitude of the estimate of democracy on tie formation and persistence appears to be accounted 
for by ties occurring between less scientifically prominent countries with weaker collaboration 
ties. The larger the sample of network ties included in the model, the greater the strength of the 
estimate on democracy. Analogously, this could also be due to a lower tie change rate for higher 
edge-weight thresholds. The endogenous controls also show interesting results. The ‘edges’ term 
show the extremely high density in the full network but lowers for higher edge trimming criteria. 
Preferential attachment (GWdegree) and transitivity (GWESP and GWDSP) also show positive 
significant coefficients across the models. For exogenous controls, the estimates on the number 


of authors, GDP, and geo-political region are mostly positive across models with consistent 
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homophily effects, while urbanization and population size have somewhat more inconsistent 
results across models. 

The STERGMs converged and produced reasonable MCMC diagnostic results. However, 
the goodness-of-fit results for the STERGMs showed a relatively poor model fit on the edge- 
wise shared partner distribution for any model specification (see Appendix for diagnostics). 
Alpha and decay parameters on GWD, GWESP, and GWDSP, were adjusted separately for the 
formation and persistence sides of the STERG model, as the persistence side appeared to show 
considerable confoundedness between centralization and clustering, and the signs on these terms 
were sensitive to changes in the decay and alpha parameters (e.g. Levy 2016). A final concern 
with these models is multicollinearity. The variance inflation test (VIF) for ERGMs, recently 
developed by Duxburry (2021) showed considerable multicollinearity in all four models. The 
third model showed multicollinearity only in the persistence side of the model. While these 
issues were not significant enough to prevent model convergence, they do point to the difficulty 
in modeling complex networks of this type. 

Next, the results of the valued exponential random graph models (VERGMs) are 
presented. VERGMs with a Poisson reference distribution would not converge with any 
specification of model options, possibly due to the extreme density in the networks and to 
overdispersion in the edge values (see Huang and Butts, 2012). To achieve convergence, the 
edge weights (+1) were first transformed using the natural logarithm then quantized at the first, 
second, third, and fourth quantiles (assessed without zeros). The discrete uniform reference 


distribution was selected to match this transformation. Three valued network controls were 
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included, ‘sum’, ‘nodesqrtcovar’, and ‘transitive.weights’.? Exogenous controls identical to 


Table 2 were included in all models. 


Table 3: Valued Exponential Random Graph Models 2008-2017 


2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 
sum -12.33*** -12.36** -10.75*** -9.93 *** -9.60 *** -9.84 *** -9.72 *** -9.75 *** -13.56 *** -8.27 *** 
(0.52) (0.52) (0.49) (0.46) (0.45) (0.43) (0.47) (0.46) (0.08) (0.45) 
nodesqrtcovar 0.70*** 0.89*** 0.73*** 0.69 *** 0.85 *** 0.83 *** 1:07 *** 1.07 *** 1.78 *** 1.34:*** 
(0.05) (0.05) (0.04) (0.04) (0.04) (0.04) (0.03) (0.03) (0.04) (0.03) 
transitive.weights 0.53 * 0.50 * 0.21 0.07 -0.07 -0.47 * -0.10 -0.37 0.04 -0.67 ** 
(0.24) (0.24) (0.22) (0.22) (0.23) (0.20) (0.24) (0.24) (0.22) (0.24) 
nodecov.libdem 0.30 *** 0.34*** 0.26*** 0.13 ** 0.15 *** 0.14 ** 0.08 0.10 * 0.28 *** 0.37 *** 
(0.05) (0.05) (0.05) (0.05) (0.04) (0.04) (0.04) (0.04) (0.04) (0.04) 
absdiff.libdem -0.32 *** -0.41 *** -0.33 *** -0.20 *** -0.25 *** -0.20 *** -0.14 * -0.10 -0.26 *** -0.21 *** 
(0.06) (0.06) (0.06) (0.06) (0.05) (0.06) (0.06) (0.06) (0.06) (0.06) 
nodecov.Inauthors 037-7** 0.35 *** 0.37 *** 0.38 *** 0.37 *** 0.38 *** 0.39 *** 0.37 *** 0.43 *** 0.38 *** 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
absdiff.Inauthors -0.08 *** -0.10 *** -0.09 *** -0.08 *** -0.09 *** -0.09 *** -0.11 *** -0.13 *** -0.18 *** -0.12 *** 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
nodecov.urban -0.00 -0.00 ** -0.00 ** -0.00 *** -0.00 -0.00 *** -0.00 * -0.00 * -0.00 -0.00 *** 
(0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) 
absdiff.urban 0.00 0.00 -0.00 0.00 -0.00 -0.00 0.00 * 0.00 -0.00 -0.00 
(0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) (0.00) 
nodecov.Ingdp 0.06 *** 0.08 *** 0.05 ** 0.07 *** 0.05 *** 0.10 *** 0.08 *** 0.08 *** 0.08 *** 0.06 *** 
(0.02) (0.02) (0.02) (0.02) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
absdiff.Ingdp -0.04 ** -0.06 *** -0.06 *** -0.09 *** -0.07 *** -0.09 *** -0.08 *** -0.09 *** -0.05 *** -0.05 *** 
(0.01) (0.02) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
nodecov.Inpop 011 ttt 0.11 *** 0.08 *** 0.06 *** 0.06 *** 0.05 *** 0.03 ** 0.05 *** 0.10 *** 0.01 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
absdiff.Inpop -0.04 *** -0.04 *** -0.03 *** -0.04 *** -0.04 *** -0.05 *** -0.04 *** -0.03 *** -0.01 0.02 * 
(0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) (0.01) 
nodematch.region 1:59 *** 1.68 *** 1.50 *** 1.61 *** 1.41 *** 1.48 *** 1:50/%%* 1.39 *** 1.58 *** 1,.24:*** 
(0.04) (0.04) (0.04) (0.04) (0.04) (0.04) (0.04) (0.04) (0.04) (0.04) 
nodefactor.region Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed 
Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed Fixed 


* 


™p < 0.001; **p < 0.01; *p < 0.05; standard errors in parentheses; Nodecov is a continuous node covariate. Absdiff is the absolute difference between continuous 


node values (negative coefficient = homophily). Nodematch is the match between shared node categories (positive coefficient = homophily). Nodefactor is a 


categorical node effect. Nodesqrtcovar represents preferential attachment. Transitive.weights represents transitivity. 


The models presented in Table 3 provide consistent support from 2008 to 2017 for both 


H1c and H2c, which specify respectively that the liberal democracy index is associated with the 


2 As a side note, the inclusion of transitive.weights increased computation time from around roughly 30 minutes to 


roughly 24 hours for each model, using an i7-8750H processor with 16Gb of ram. 
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strength of international research collaboration ties and that similarity in the index is associated 
with increased strength of international collaboration ties between countries. Support for the 
hypothesized main effect of democracy, ‘nodecov.libdem’, appears to wane in the middle years 
then increase in strength toward the end. Support for the hypothesized homophily effect of 
democracy on tie strength is also supported by the negative significant coefficient on the 
‘absdiff.libdem’ term (negative is homophily; positive is heterophily). These results appear 
consistent with the formation side of the STERGMs, where there are positive estimates on 
number of authors, GDP, and population size, as well as homophilous tie strength on these terms 
and geo-political region. Similarly, urbanization showed negative though inconsistently 
significant estimates across the time period. Interestingly, the transitivity estimate in the valued 
ERG showed a gradual decrease in significance and then a reversal of sign toward the end of the 
period. It is generally unusual to see a negative transitivity estimate. Following Krivitsky (2012) 
this is perhaps due to the inclusion of actor the heterogeneity ‘nodesqrtcovar’ which appears to 
be gaining strength across the time period. It may also be due to the increasing density of the 
networks, such that the proportion of closed triads is increasing regardless of tie strength. There 
is currently no implementation of the goodness-of-fit procedure for count ERGMS. Finally, the 
VIF test showed no issues with multicollinearity in the VERGMs. 

Limitations. There are a few limitations to the current study. While the V-Dem data does 
have a very broad coverage of countries in the international system, there were at least thirty 
countries contained in the bibliometric data which were not covered. In this case, having very 
broad data coverage is important because much of the variability in tie changes over time is 
between less developed countries, and democracy effects may be underestimated. Another 


limitation is that there remain no valued network options for temporal inferential network 
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models, which required binarizing the tie values. Another limitation was the failure to find a 
model to converge on the valued network with raw edge weights using the Poisson distribution. 
This may be due to the extreme density of the networks and the wide dispersion in the edge 
weight values (e.g. Huang and Butts, 2021). There are also well-known limitations of the Web of 
Science data base used for the networks. Finally, as with all observational studies, in the absence 
of a control group and randomization, causal inference cannot be conclusively established 
regarding hypothesized effects. Hence, claims regarding the results are limited to association 
rather than causation. Future, studies might estimate similar models on sub-networks, perhaps 
comparing disciplines hypothesized as more or less affected by democratic governance. Another 
idea for future study is to connect network measures with other measures of interest such as 


scientific impact. 


DISCUSSION AND CONCLUSION 
This study sought out to test two hypotheses, each with three sub-hypotheses, using longitudinal 
data merged together from the Varieties of Democracy Institute, Scopus, Web of Science, and 
World Bank Indicators, and analyzed with two types of inferential network analysis: separable 
temporal exponential random graph models (STERGM), which leveraged the panel structure of 
the data; and valued exponential random graph models (VERGM), leveraging edge weights but 
treating each year sequentially over time. 

The first hypothesis concerned the direct effects of liberal democracy on international 
collaboration ties with uniform effects across tie formation (Hla), persistence (H1b), and 
strength (H1c). The study results supported the conclusion that liberal democracy does appear to 


be a significant driver of international collaboration tie formation and persistence. However, 
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these results appear to depend on the tie inclusion criteria, with more restrictive criteria based on 
stronger tie weights attenuating these effects. The VERGM results suggested that liberal 
democracy does appear to be a significant predictor of tie strength. Together, results provide 
empirical support for the argument that democracy and science share similar values, are 
productively compatible, and that democracy provides the context for the full development and 
flourishing of global scientific inquiry (Barber, 1953; Merton 1938, 1973; Parsons, 1951; 
Popper, 2020). The results also add to an emerging literature that broadly seeks to test the 
relationship between democracy and science empirically (Whetsell et al. 2021). Further, the 
results speak to a growing body of research on autocratization and the erosion of academic 
freedom around the world (Lithrmann and Lindberg, 2019; Boese, 2022; Berggren & Bjørnskov, 
2022). Future research should investigate what impacts rising autocratization are having on 
global science. 

The second hypothesis concerned assortative mixing effects of liberal democracy on 
international collaboration ties with uniform effects across tie formation (H2a), persistence 
(H2b), and strength (H2c). The STERGM results showed significant positive results regarding 
the hypothesized effect of governance homophily in both the formation and persistence sides of 
the model, suggesting that researchers from countries with similar levels of liberal democracy are 
more likely to form and maintain international collaboration ties. The VERGM results suggested 
that ties are more likely to form between countries of similar liberal democracy scores. These 
results speak to a stream of international relations literature that has debated whether 
democracies really do cooperate with each other more or less than countries with other types of 
governance (Kinne, 2013; Gallop, 2016). While, this literature has come to include attention to 


endogenous network effects in recent years, the current study shows that democracies indeed 
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appear to be cooperating with one another on international science, despite exogenous economic, 
geographical, and population factors, as well as endogenous network factors, such as preferential 
attachment and transitivity, which have often rendered political governance insignificant. The 
current study shows that homophily of governance structure is an important empirical predictor 
of international research collaboration. 

This study also uncovered several non-hypothesized but nevertheless interesting results. 
Economic capacity, researcher capacity, and geo-political region appear to be significant drivers 
of international collaboration ties. In addition, these factors all appear to show strong homophily 
positive estimates on tie formation. In other words, international collaboration ties are more 
likely to form, persist and strengthen between economically powerful countries, between 
countries with similar researcher capacity, and between countries in the same geo-political 
region. One methodological finding that may be of interest to future researchers is that studying 
international collaboration requires an understanding of existing network density. Currently, the 
network is extremely dense with almost all ties between countries being realized with at least one 
country-country paper tie. This presents challenges for researchers seeking to understand country 
level effects on international research since there is less tie change variability between well- 
connected countries. However, this also points to the need for greater attention to the effects of 
governance factors on science among traditionally less well-connected countries. 

Future research should further investigate the connections between factors of governance 
and processes and outcomes of international science. As the international collaboration network 
reaches near complete density, it will be necessary to investigate various sub-networks of 
disciplines selected based on hypothesized effects of governance factors. Further, newer 


measures of academic freedom have recently been developed which invites more fine-grained 
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investigation into the details of democratic effects at the institutional level. Future studies should 
link up network measures used to characterize country position with scientific outcomes such as 
novelty and impact. 

This study addresses a gap in the literature on international collaboration which has thus 
far almost completely avoided the question of comparative political governance. This is an 
unfortunate lacuna since insights about the structural antecedents of international research 
collaboration are relevant to policymakers seeking greater scientific impact on the world stage. 
While the world undergoes a third wave of democratic backsliding, the question remains open as 
to the long-term effects on international collaboration. This study shows that in fact liberal 
democracy is a significant predictor of international research collaboration. At the same time, a 
large majority of the world’s citizens live under varying degrees of autocratic rule, where 
academic freedom is undermined or non-existent. Policymakers seeking to develop advanced 
scientific systems should consider democratic reforms that promote greater development of 
scientific human capital, remove constraints from academic freedom, and provide support the 


self-organization of scientific activity. 
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Appendix — Model Diagnostics for ERGMs 


Figure A.1 — Goodness of fit plot for full binarized network STERGM 
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Figure A.2 — Goodness of fit plot for 1* quantile STERGM 


Figure A.3 — Goodness of fit plot for median STERGM 
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Figure A.4 — Goodness of fit plot for 3"! Quantile STERGM 
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